Targeter analysis

C:/Temp

Introduction

This report contains a snapshot of the specific subpopulation «r label(object$target,metadata) » == `r object$target_reference_level``.

This subpopulation is characterized by performing bivariate analyses, crossing the variable of interest (target) with potential explanatory variables. Only most relevant (highest discriminative power) may be retained.

Report contains:

  • A summary overview table with selected retained criteria with some available information such as one statistic called information value (IV) that provides an idea of the explanatory power of the criteria.
  • Detailed analyses per criteria with graphics that highlight the reation between one of the candidate criteria and the target.

It can happen that one criteria contains some missing values. If such, we will introduce value “[Missing]” in the report, eventually with a cross [x]. Also, if two few data are available, we would use a minus [-] instead of reporting value.

Summary table interpretation

Criteria are sorted by decreasing importance order, which one is determined by the IV. One modality (value) index is also computed as the proportion of target «r label(object$target,metadata)» within the modality, that is the target penetration in the subpopulation selected by the specific modality of a criteria. A value near 1 would indicate a non-discriminative modality: selecting the corresponding subpopulation would result in the overall average target penetration. On the other way, the hightest the index (>1), the more target is over-representated in the subpopulation. Low index (<1) would be interpreted as under-representation. So high index modalities are of interest as they characterize subpopulation with high target penetration.

Note that index will always be computed for segments (modalities): continuous variables are cut into buckets. If a criteria/variable has no segment with any high index, it will not be of interest at all. Also, having one segment with an index greater than 1 implies that there are counterparts segments with index lower 1.

There is no special rule to say that an index is a good one as its interpretation has to take into account the natural (average) target pourcentage (penetration). Indeed, when the penetration is low, it is easier to observe eventually high index. Think that for a global penetration of 50% of target, maximal index is 2 (and more generally, index is bounded by 1/(target penetration))

For this report, we selected a maximum of r ntop potential criteria.

For every criteria, summary table presents its possible value (modality) having the highest index along with the corresponding number of records. Pay attention that you might observe high index dealing with very restrictive sub-populations.

Population description

Target description

Target analysis :

Count Proportion target_reference_level
ABOVE50K
0 24720 0.75919
1 7841 0.24081 x

Summary view

name n_bins iv qs Max bin MaxBin Rate MaxBin N
Loading... (need help?)

Variables

AGE

Bin Count Count (%) Event rate WoE
0 (-inf, 21.50) 3130 0.096127 0.001597 5.289505
1 [21.50, 23.50) 1642 0.050428 0.015225 3.021206
2 [23.50, 25.50) 1639 0.050336 0.051251 1.770168
3 [25.50, 29.50) 3300 0.101348 0.120303 0.841317
4 [29.50, 31.50) 1749 0.053715 0.202401 0.223107
5 [31.50, 33.50) 1703 0.052302 0.223136 0.099240
6 [33.50, 35.50) 1762 0.054114 0.261067 -0.107816
7 [35.50, 37.50) 1756 0.053930 0.316059 -0.376305
8 [37.50, 40.50) 2437 0.074844 0.339762 -0.483891
9 [40.50, 43.50) 2358 0.072418 0.348601 -0.523050
10 [43.50, 49.50) 4023 0.123553 0.390753 -0.704099
11 [49.50, 54.50) 2554 0.078437 0.413861 -0.800218
12 [54.50, 61.50) 2476 0.076042 0.342084 -0.494225
13 [61.50, inf) 2032 0.062406 0.223917 0.094736
14 Special 0 0.000000 0.000000 0.000000
15 Missing 0 0.000000 0.000000 0.000000

WORKCLASS

Bin Count Count (%) Event rate WoE
0 [ Never-worked, Without-pay, ?] 1857 0.057031 0.102854 1.017661
1 [ Private] 22696 0.697030 0.218673 0.125171
2 [ State-gov, Self-emp-not-inc] 3839 0.117902 0.280542 -0.206471
3 [ Local-gov] 2093 0.064279 0.294792 -0.276024
4 [ Federal-gov, Self-emp-inc] 2076 0.063757 0.478324 -1.061487
5 Special 0 0.000000 0.000000 0.000000
6 Missing 0 0.000000 0.000000 0.000000

FNLWGT

Bin Count Count (%) Event rate WoE
0 (-inf, 188102.00) 17984 0.552317 0.249722 -0.048151
1 [188102.00, 209923.00) 3662 0.112466 0.244675 -0.021029
2 [209923.00, 242536.50) 3216 0.098768 0.223259 0.098530
3 [242536.50, 267851.00) 1629 0.050029 0.195826 0.264345
4 [267851.00, 317025.50) 2369 0.072756 0.215703 0.142640
5 [317025.50, inf) 3701 0.113664 0.244799 -0.021698
6 Special 0 0.000000 0.000000 0.000000
7 Missing 0 0.000000 0.000000 0.000000

EDUCATION

Bin Count Count (%) Event rate WoE
0 [ Preschool, 1st-4th, 5th-6th, 11th, 9th] 2241 0.068825 0.048639 1.825222
1 [ 7th-8th, 10th, 12th] 2012 0.061792 0.067097 1.483909
2 [ HS-grad] 10501 0.322502 0.159509 0.513642
3 [ Some-college] 7291 0.223918 0.190235 0.300241
4 [ Assoc-acdm, Assoc-voc] 2449 0.075213 0.255615 -0.079358
5 [ Bachelors] 5355 0.164461 0.414753 -0.803894
6 [ Masters, Prof-school, Doctorate] 2712 0.083290 0.622419 -1.648074
7 Special 0 0.000000 0.000000 0.000000
8 Missing 0 0.000000 0.000000 0.000000

EDUCATIONNUM

Bin Count Count (%) Event rate WoE
0 (-inf, 5.50) 1712 0.052578 0.051986 1.755149
1 [5.50, 8.50) 2541 0.078038 0.061000 1.585702
2 [8.50, 9.50) 10501 0.322502 0.159509 0.513642
3 [9.50, 10.50) 7291 0.223918 0.190235 0.300241
4 [10.50, 12.50) 2449 0.075213 0.255615 -0.079358
5 [12.50, 13.50) 5355 0.164461 0.414753 -0.803894
6 [13.50, inf) 2712 0.083290 0.622419 -1.648074
7 Special 0 0.000000 0.000000 0.000000
8 Missing 0 0.000000 0.000000 0.000000

MARITALSTATUS

Bin Count Count (%) Event rate WoE
0 [ Never-married, Separated] 11708 0.359571 0.047574 1.848473
1 [ Married-spouse-absent, Widowed, Divorced] 5854 0.179786 0.099419 1.055448
2 [ Married-AF-spouse, Married-civ-spouse] 14999 0.460643 0.446830 -0.934758
3 Special 0 0.000000 0.000000 0.000000
4 Missing 0 0.000000 0.000000 0.000000

OCCUPATION

Bin Count Count (%) Event rate WoE
0 [ Priv-house-serv, Other-service, Handlers-cleaners] 4814 0.147846 0.046531 1.871743
1 [ ?, Armed-Forces, Farming-fishing] 2846 0.087405 0.107871 0.964432
2 [ Machine-op-inspct] 2002 0.061485 0.124875 0.798806
3 [ Adm-clerical] 3770 0.115783 0.134483 0.713645
4 [ Transport-moving, Craft-repair] 5696 0.174933 0.219277 0.121640
5 [ Sales] 3650 0.112097 0.269315 -0.150146
6 [ Tech-support, Protective-serv, Prof-specialty] 5717 0.175578 0.411579 -0.790807
7 [ Exec-managerial] 4066 0.124873 0.484014 -1.084280
8 Special 0 0.000000 0.000000 0.000000
9 Missing 0 0.000000 0.000000 0.000000

RELATIONSHIP

Bin Count Count (%) Event rate WoE
0 [ Own-child, Other-relative] 6049 0.185774 0.017193 2.897669
1 [ Unmarried] 3446 0.105832 0.063262 1.546877
2 [ Not-in-family] 8305 0.255060 0.103070 1.015318
3 [ Husband, Wife] 14761 0.453334 0.451392 -0.953199
4 Special 0 0.000000 0.000000 0.000000
5 Missing 0 0.000000 0.000000 0.000000

RACE

Bin Count Count (%) Event rate WoE
0 [ Other, Amer-Indian-Eskimo, Black] 3706 0.113817 0.120885 0.835829
1 [ White, Asian-Pac-Islander] 28855 0.886183 0.256212 -0.082496
2 Special 0 0.000000 0.000000 0.000000
3 Missing 0 0.000000 0.000000 0.000000

SEX

Bin Count Count (%) Event rate WoE
0 [ Female] 10771 0.330795 0.109461 0.948017
1 [ Male] 21790 0.669205 0.305737 -0.328119
2 Special 0 0.000000 0.000000 0.000000
3 Missing 0 0.000000 0.000000 0.000000

CAPITALGAIN

Bin Count Count (%) Event rate WoE
0 (-inf, 4668.50) 30862 0.947821 0.204523 0.210013
1 [4668.50, inf) 1699 0.052179 0.899941 -3.344817
2 Special 0 0.000000 0.000000 0.000000
3 Missing 0 0.000000 0.000000 0.000000

CAPITALLOSS

Bin Count Count (%) Event rate WoE
0 (-inf, inf) 32561 1.0 0.24081 -2.220446e-16
1 Special 0 0.0 0.00000 0.000000e+00
2 Missing 0 0.0 0.00000 0.000000e+00

HOURSPERWEEK

Bin Count Count (%) Event rate WoE
0 (-inf, 25.50) 3943 0.121096 0.065686 1.506680
1 [25.50, 34.50) 1640 0.050367 0.078049 1.320912
2 [34.50, 39.50) 2180 0.066951 0.161009 0.502492
3 [39.50, 41.50) 15253 0.468444 0.213335 0.156692
4 [41.50, 49.50) 3083 0.094684 0.346740 -0.514847
5 [49.50, 55.50) 3730 0.114554 0.447721 -0.938364
6 [55.50, inf) 2732 0.083904 0.406296 -0.768946
7 Special 0 0.000000 0.000000 0.000000
8 Missing 0 0.000000 0.000000 0.000000

NATIVECOUNTRY

Bin Count Count (%) Event rate WoE
0 [ Holand-Netherlands, Outlying-US(Guam-USVI-etc), Dominican-Republic, Columbia, Guatemala, Mexico, Nicaragua, Peru, Vietnam, Honduras, El-Salvador, Haiti, Puerto-Rico, Trinadad&Tobago, Portugal, Laos, Jamaica, Ecuador, Thailand, Poland, South, Ireland, Hungary] 1638 0.050306 0.083028 1.253652
1 [ United-States, Scotland] 29182 0.896226 0.245836 -0.027304
2 [ ?, Cuba, China, Greece, Hong, Philippines, Germany, Canada, England, Italy, Cambodia, Yugoslavia, Japan, Taiwan, India, France, Iran] 1741 0.053469 0.304997 -0.324633
3 Special 0 0.000000 0.000000 0.000000
4 Missing 0 0.000000 0.000000 0.000000